Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
โšกTokenizer Benchmarks
Semantic Dictionary Encoding
falvotech.comยท2hยท
Discuss: Hacker News
๐Ÿ—‚๏ธType Indexing
Learn How to Use Transformers with HuggingFace and SpaCy
towardsdatascience.comยท3h
๐Ÿ“ŠPratt Parsers
Text-to-SQL Oriented to the Process Mining Domain: A PT-EN Dataset for Query Translation
arxiv.orgยท13h
๐Ÿง Semantic Parsing
Building Writely with Kiro: From PRD to Browser Extension in Hours ๐ŸŽ‰
youtu.beยท6hยท
Discuss: DEV
๐Ÿ’ฌInteractive REPLs
Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.comยท2hยท
Discuss: Hacker News
๐Ÿš€Tokenizer Performance
ECMAScript TC39 proposal-pattern-matching
github.comยท3hยท
Discuss: Hacker News
๐ŸŽฏPattern Matching
I built an LLM from Scratch in Rust (Just ndarray and rand)
github.comยท1dยท
๐ŸŒฑMinimal ML
Baking with Rails at scale: recipes in Ruby, cookware from Go, C, and Rust
evilmartians.comยท17h
๐Ÿ’ฌSmalltalk VMs
2.14.0 released
news.nononsenseapps.comยท21h
๐Ÿ“‹Backus-Naur Form
beline88 - ChileComparte
chilecomparte.clยท4h
๐ŸŒ‰Cross-Language Tools
LLM Rerankers for RAG: A Practical Guide
fin.aiยท19hยท
๐ŸชœRecursive Descent
Analyzing Lisp Redux: One Form At a Time
funcall.blogspot.comยท2hยท
๐Ÿ”ฎLisp Interpreters
The Sacred Machine: Profane Artifact and Gateway to Truth
reddit.comยท2hยท
Discuss: r/LLM
๐Ÿ”Tokenizers
LLM's Functions, Use-cases & Architecture: Introduction
dev.toยท1hยท
Discuss: DEV
๐Ÿ“ŠLR Parsing
Glypht
glypht.valadaptive.devยท59m
๐Ÿ”„Incremental Lexing
[1] Algorithm Showdown: Python vs. JavaScript - Group Anagrams
dev.toยท20hยท
Discuss: DEV
๐Ÿ”—Hash Functions
AI Tokenization Services
dev.toยท7hยท
Discuss: DEV
๐Ÿ”Tokenizers
Latin Extended character set added!
tjtrewin.itch.ioยท3h
๐Ÿ”„Incremental Lexing